Improving Dimensionality Reduction Projections for Data Visualization
نویسندگان
چکیده
In data science and visualization, dimensionality reduction techniques have been extensively employed for exploring large datasets. These involve the transformation of high-dimensional into reduced versions, typically in 2D, with aim preserving significant properties from original data. Many algorithms exist, nonlinear approaches such as t-SNE (t-Distributed Stochastic Neighbor Embedding) UMAP (Uniform Manifold Approximation Projection) gained popularity field information visualization. this paper, we introduce a simple yet powerful manipulation vector datasets that modifies their values based on weight frequencies. This technique significantly improves results across various scenarios. To demonstrate efficacy our methodology, conduct an analysis collection well-known labeled The improved clustering performance when attempting to classify space. Our proposal presents comprehensive adaptable approach enhance outcomes visual exploration.
منابع مشابه
Dimensionality Reduction for Data Visualization
Dimensionality reduction is one of the basic operations in the toolbox of data-analysts and designers of machine learning and pattern recognition systems. Given a large set of measured variables but few observations, an obvious idea is to reduce the degrees of freedom in the measurements by representing them with a smaller set of more “condensed” variables. Another reason for reducing the dimen...
متن کاملDimensionality reduction for financial data visualization
Various data mining methods are used for examining large financial data sets to uncover hidden and useful information. Ability to access big data sources raises new challenges related with capabilities to handle such enormous amounts of data. This research focuses on big financial data visualization that is based on dimensionality reduction methods. We use data set that contains financial ratio...
متن کاملRandom Projections for Dimensionality Reduction in ICA
In this paper we present a technique to speed up ICA based on the idea of reducing the dimensionality of the data set preserving the quality of the results. In particular we refer to FastICA algorithm which uses the Kurtosis as statistical property to be maximized. By performing a particular Johnson-Lindenstrauss like projection of the data set, we find the minimum dimensionality reduction rate...
متن کاملDimensionality reduction for visualization of normal and pathological speech data
For an adequate analysis of pathological speech signals, a sizeable number of parameters is required, such as those related to jitter, shimmer and noise content. Often this kind of high-dimensional signal representation is difficult tounderstand, even for expert voice therapists andphysicians.Data visualization of a high-dimensional dataset can provide a useful first step in its exploratory dat...
متن کاملInformation Retrieval Perspective to Nonlinear Dimensionality Reduction for Data Visualization
Nonlinear dimensionality reduction methods are often used to visualize high-dimensional data, although the existing methods have been designed for other related tasks such as manifold learning. It has been difficult to assess the quality of visualizations since the task has not been well-defined. We give a rigorous definition for a specific visualization task, resulting in quantifiable goodness...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13179967